Graph Quality Judgement: A Large Margin Expedition
نویسندگان
چکیده
Graph as a common structure of machine learning, has played an important role in many learning tasks such as graph-based semi-supervised learning (GSSL). The quality of graph, however, seriously affects the performance of GSSL; moreover, an inappropriate graph may even cause deteriorated performance, that is, GSSL using unlabeled data may be outperformed by direct supervised learning with only labeled data. To this end, it is desired to judge the quality of graph and develop performance-safe GSSL methods. In this paper we propose a large margin separation method LEAD for safe GSSL. Our basic idea is that, if a certain graph owns a high quality, its predictive results on unlabeled data may have a large margin separation. We should exploit the large margin graphs while keeping the small margin graphs (which might be risky) to be rarely exploited. Based on this recognition, we formulate safe GSSL as Semi-Supervised SVM (S3VM) optimization and present an efficient algorithm. Extensive experimental results demonstrate that our proposed method can effectively improve the safeness of GSSL, in addition achieve highly competitive accuracy with many state-of-the-art GSSL methods.
منابع مشابه
Two new species of Palapedia Ng, 1993 (Crustacea, Decapoda, Brachyura,<br />Xanthidae) from the Persian Gulf.
Two new species of Palapedia Ng, 1993, are described based on material collected from Abu-Musa Island, Persian Gulf during the present study, from Bahrain by the 1937/38 Danish Expedition, and from the Saudi Arabian coast of the Persian Gulf by Michael Apel in 1992-1995. Palapedia persica n. sp. is distinguishable from its congeners by having distinctly large denticles on the upper margin of th...
متن کاملHalogen and I systematics in gas hydrate fields at the northern Cascadia margin (IODP Expedition 311): Insights from numerical modeling
[1] We measured halogen concentrations and I/I ratios in five drilling sites of Integrated Ocean Drilling Program Expedition 311 (offshore Vancouver Island, Canada) in order to identify potential sources of fluids and methane in gas hydrate fields. Iodine is dominated by organic decomposition and transports with fluids in reducing environments and the presence of the cosmogenic radioisotope I (...
متن کاملSampling from social networks’s graph based on topological properties and bee colony algorithm
In recent years, the sampling problem in massive graphs of social networks has attracted much attention for fast analyzing a small and good sample instead of a huge network. Many algorithms have been proposed for sampling of social network’ graph. The purpose of these algorithms is to create a sample that is approximately similar to the original network’s graph in terms of properties such as de...
متن کاملLarge Margin Boltzmann Machines and Large Margin Sigmoid Belief Networks
Current statistical models for structured prediction make simplifying assumptions about the underlying output graph structure, such as assuming a low-order Markov chain, because exact inference becomes intractable as the tree-width of the underlying graph increases. Approximate inference algorithms, on the other hand, force one to trade off representational power with computational efficiency. ...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کامل